HCV genomic sequences for diagnostics and therapeutics

ABSTRACT

The present application features nucleic acid, peptide and antibody compositions relating to genotypes of hepatitis C virus and methods of using such compositions for diagnostic and therapeutic purposes.

This application is a divisional of application Ser. No. 08/221,653, filed Apr. 1, 1994 which is a continuation of application Ser. No. 07/881,528, filed May 8, 1992, now abandoned which is a continuation-in-part of 07/697,326 filed May 8, 1991 and now abandoned.

TECHNICAL FIELD

The invention relates to compositions and methods for the detection and treatment of hepatitis C virus, (HCV) infection, formerly referred to as blood-borne non-A, non-B hepatitis virus (NANBV) infection. More specifically, embodiments of the present invention feature compositions and methods for the detection of HCV, and for the development of vaccines for the prophylactic treatment of infections of HCV, and development of antibody products for conveying passive immunity to HCV.

BACKGROUND OF THE INVENTION

The prototype isolate of HCV was characterized in U.S. patent application Ser. No. 122,714 (See also EPO Publication No. 318,216). As used herein, the term "HCV" includes new isolates of the same viral species. The term "HCV-1" referred to in U.S. patent application Ser. No. 122,714.

HCV is a transmissible disease distinguishable from other forms of viral-associated liver diseases, including that caused by the known hepatitis viruses, i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), and delta hepatitis virus (HDV), as well as the hepatitis induced by cytomegalovirus (CMV) or Epstein-Barr virus (EBV). HCV was first identified in individuals who had received blood transfusions.

The demand for sensitive, specific methods for screening and identifying carriers of HCV and HCV contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs in approximately 10% of transfused patients, and HCV accounts for up to 90% of these cases. The disease frequently progresses to chronic liver damage (25-55%).

Patient care as well as the prevention of transmission of HCV by blood and blood products or by close personal contact require reliable screening, diagnostic and prognostic tools to detect nucleic acids, antigens and antibodies related to HCV.

Information in this application suggests the HCV has several genotypes. That is, the genetic information of the HCV virus may not be totally identical for all HCV, but encompasses groups with differing genetic information.

Genetic information is stored in thread-like molecules of DNA and RNA. DNA consists of covalently linked chains of deoxyribonucleotides and RNA consists of covalently linked chains of ribonucleotides. Each nucleotide is characterized by one of four bases: adenine (A), guanine (G), thymine (T), and cytosine (C). The bases are complementary in the sense that, due to the orientation of functional groups, certain base pairs attract and bond to each other through hydrogen bonding and π-stacking interactions. Adenine in one strand of DNA pairs with thymine in an opposing complementary strand. Guanine in one strand of DNA pairs with cytosine in an opposing complementary strand. In RNA, the thymine base is replaced by uracil (U) which pairs with adenine in an opposing complementary strand. The genetic code of living organism is carried in the sequence of base pairs. Living cells interpret, transcribe and translate the information of nucleic acid to make proteins and peptides.

The HCV genome is comprised of a single positive strand of RNA. The HCV genome possesses a continuous, translational open reading frame (ORF) that encodes a polyprotein of about 3,000 amino acids. In the ORF, the structural protein(s) appear to be encoded in approximately the first quarter of the N-terminus region, with the majority of the polyprotein responsible for non-structural proteins.

The HCV polyprotein comprises, from the amino terminus to the carboxy terminus, the nucleocapsid protein (C), the envelope protein (E), and the non-structural proteins (NS) 1, 2 (b), 3, 4 (b), and 5.

HCV of differing genotypes may encode for proteins which present an altered response to host immune systems. HCV of differing genotypes may be difficult to detect by immuno diagnostic techniques and nucleic acid probe techniques which are not specifically directed to such genotype.

Definitions for selected terms used in the application are set forth below to facilitate an understanding of the invention. The term "corresponding" means homologous to or complementary to a particular sequence of nucleic acid. As between nucleic acids and peptides, corresponding refers to amino acids of a peptide in an order derived from the sequence of a nucleic acid or its complement.

The term "non-naturally occurring nucleic acid" refers to a portion of genomic nucleic acid, cDNA, semisynthetic nucleic acid, or synthetic origin nucleic acid which, by virtue of its origin or manipulation: (1) is not associated with all of a nucleic acid with which it is associated in nature, (2) is linked to a nucleic acid or other chemical agent other than that to which it is linked in nature, or (3) does not occur in nature.

Similarly the term, "a non-naturally occurring peptide" refers to a portion of a large naturally occurring peptide or protein, or semi-synthetic or synthetic peptide, which by virtue of its origin or manipulation (1) is not associated with all of a peptide with which it is associated in nature, (2) is linked to peptides, functional groups or chemical agents other than that to which it is linked in nature, or (3) does not occur in nature.

The term "primer" refers to a nucleic acid which is capable of initiating the synthesis of a larger nucleic acid when placed under appropriate conditions. The primer will be completely or substantially complementary to a region of the nucleic acid to be copied. Thus, under conditions conducive to hybridization, the primer will anneal to a complementary region of a larger nucleic acid. Upon addition of suitable reactants, the primer is extended by the polymerizing agent to form a copy of the larger nucleic acid.

The term "binding pair" refers to any pair of molecules which exhibit mutual affinity or binding capacity. For the purposes of the present application, the term "ligand" will refer to one molecule of the binding pair, and the term "antiligand" or "receptor" or "target" will refer to the opposite molecule of the binding pair. For example, with respect to nucleic acids, a binding pair may comprise two complementary nucleic acids. One of the nucleic acids may be designated the ligand and the other strand is designated the antiligand receptor or target. The designation of ligand or antiligand is a matter of arbitrary convenience. Other binding pairs comprise, by way of example, antigens and antibodies, drugs and drug receptor sites and enzymes and enzyme substrates, to name a few.

The term "label" refers to a molecular moiety capable of detection including, by way of example, without limitation, radioactive isotopes, enzymes, luminescent agents, precipitating agents, and dyes.

The term "support" includes conventional supports such as filters and membranes as well as retrievable supports which can be substantially dispersed within a medium and removed or separated from the medium by immobilization, filtering, partitioning, or the like. The term "support means" refers to supports capable of being associated to nucleic acids, peptides or antibodies by binding partners, or covalent or noncovalent linkages.

A number of HCV strains and isolates have been identified. When compared with the sequence of the original isolate derived from the USA ("HCV-1"; see Q.-L. Choo et al. (1989) Science 244:359-362, Q.-L. Choo et al. (1990) Brit. Med. Bull. 46:423-441, Q.-L. Choo et al., Proc. Natl. Acad. Sci. 88:2451-2455 (1991), and E.P.O. Patent Publication No. 318,216, cited supra), it was found that a Japanese isolate ("HCV J1") differed significantly in both nucleotide and polypeptide sequence within the NS3 and NS4 regions. This conclusion was later extended to the NS5 and envelope (E1/S and E2/NS1) regions (see K. Takeuchi et al., J. Gen. Virol. (1990) 71:3027-3033, Y. Kubo, Nucl. Acids. Res. (1989) 17:10367-10372, and K. Takeuchi et al., Gene (1990) 91:287-291). The former group of isolates, originally identified in the United States, is termed "Genotype I" throughout the present disclosure, while the latter group of isolates, initially identified in Japan, is termed "Genotype II" herein.

BRIEF DESCRIPTION OF THE INVENTION

The present invention features compositions of matter comprising nucleic acids and peptides corresponding to the HCV viral genome which define different genotypes. The present invention also features methods of using the compositions corresponding to sequences of the HCV viral genome which define different genotypes described herein.

A. Nucleic acid compositions

The nucleic acid of the present invention, corresponding to the HCV viral genome which define different genotypes, have utility as probes in nucleic acid hybridization assays, as primers for reactions involving the synthesis of nucleic acid, as binding partners for separating HCV viral nucleic acid from other constituents which may be present, and as anti-sense nucleic acid for preventing the transcription or translation of viral nucleic acid.

One embodiment of the present invention features a composition comprising a non-naturally occurring nucleic acid having a nucleic acid sequence of at least eight nucleotides corresponding to a non-HCV-1 nucleotide sequence of the hepatitis C viral genome. Preferably, the nucleotide sequence is selected from a sequence present in at least one region consisting of the NS5 region, envelope 1 region, 5'UT region, and the core region.

Preferably, with respect to sequences which correspond to the NS5 region, the sequence is selected from a sequence within a sequence numbered 2-22. The sequence numbered 1 corresponds to HCV-1. Sequences numbered 1-22 are defined in the Sequence Listing of the application.

Preferably, with respect to sequences corresponding to the envelope 1 region, the sequence is selected from a sequence within sequences numbered 24-32. Sequence No. 23 corresponds to HCV-1. Sequences numbered 23-32 are set forth in the Sequence Listing of the application.

Preferably, with respect to the sequences which correspond to the 5'UT regions, the sequence is selected from a sequence within sequences numbered 34-51. Sequence No. 33 corresponds to HCV-1. Sequence No. 33-51 are set forth in the Sequence Listing of this application.

Preferably, with respect to the sequences which correspond to the core region, the sequence is selected from a sequence within the sequences numbered 53-66. Sequence No. 52 corresponds to HCV-1. Sequences 52-66 are set forth in the Sequence Listing of this application.

The compositions of the present invention form hybridization products with nucleic acid corresponding to different genotypes of HCV.

HCV has at least five genotypes, which will be referred to in this application by the designations GI-GV. The first genotype, GI, is exemplified by sequences numbered 1-6, 23-25, 33-38 and 52-57. The second genotype, GII, is exemplified by the sequences numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GIII, is exemplified by sequences numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, is exemplified by sequences numbered 20-22, and 29-31 and 48-49. The fifth genotype, GV, is exemplified by sequences numbered 18, 19, 50 and 51.

One embodiment of the present invention features compositions comprising a nucleic acid having a sequence corresponding to one or more sequences which exemplify a genotype of HCV.

B. Method of forming a Hybridization Product

Embodiments of the present invention also feature a method of forming a hybridization product with nucleic acid having a sequence corresponding to HCV nucleic acid. One method comprises the steps of placing a non-naturally occurring nucleic acid having a non-HCV-1 sequence corresponding to HCV nucleic acid under conditions in which hybridization may occur. The non-naturally occurring nucleic acid is capable of forming a hybridization product with HCV nucleic acid, under hybridization conditions. The method further comprises the step of imposing hybridization conditions to form a hybridization product in the presence of nucleic acid corresponding to a region of the HCV genome.

The formation of a hybridization product has utility for detecting the presence of one or more genotypes of HCV. Preferably, the non-naturally occurring nucleic acid forms a hybridization product with nucleic acid of HCV in one or more regions comprising the NS5 region, envelope 1 region, 5'UT region and the core region. To detect the hybridization product, it is useful to associate the non-naturally occurring nucleic acid with a label. The formation of the hybridization product is detected by separating the hybridization product from labeled non-naturally occurring nucleic acid, which has not formed a hybridization product.

The formation of a hybridization product has utility as a means of separating one or more genotypes of HCV nucleic acid from other constituents potentially present. For such applications, it is useful to associate the non-naturally occurring nucleic acid with a support for separating the resultant hybridization product from the the other constituents.

Nucleic acid "sandwich assays" employ one nucleic acid associated with a label and a second nucleic acid associated with a support. An embodiment of the present invention features a sandwich assay comprising two nucleic acids, both have sequences which correspond to HCV nucleic acids; however, at least one non-naturally occurring nucleic acid has a sequence corresponding to non-HCV-1 HCV nucleic acid. At least one nucleic acid is capable of associating with a label, and the other is capable of associating with a support. The support associated non-naturally occurring nucleic acid is used to separate the hybridization products which include an HCV nucleic acid and the non-naturally occurring nucleic acid having a non-HCV-1 sequence.

One embodiment of the present invention features a method of detecting one or more genotypes of HCV. The method comprises the steps of placing a non-naturally occurring nucleic acid under conditions which hybridization may occur. The non-naturally occurring nucleic acid is capable of forming a hybridization product with nucleic acid from one or more genotypes of HCV. The first genotype, GI, is exemplified by sequences numbered 1-6, 23-25, 33-38 and 52-57. The second genotype, GII, is exemplified by the sequences numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GIII, is exemplified by sequences numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, is exemplified sequences numbered 20-22 and 29-31. The fifth genotype, GV, is exemplified by sequences numbered 18, 19, 50 and 51.

The hybridization product of HCV nucleic acid with a non-naturally occurring nucleic acid having non-HCV-1 sequence corresponding to sequences within the HCV genome has utility for priming a reaction for the synthesis of nucleic acid.

The hybridization product of HCV nucleic acid with a non-naturally occurring nucleic acid having a sequence corresponding to a particular genotype of HCV has utility for priming a reaction for the synthesis of nucleic acid of such genotype. In one embodiment, the synthesized nucleic acid is indicative of the presence of one or more genotypes of HCV.

The synthesis of nucleic acid may also facilitate cloning of the nucleic acid into expression vectors which synthesize viral proteins.

Embodiments of the present methods have utility as anti-sense agents for preventing the transcription or translation of viral nucleic acid. The formation of a hybridization product of a non-naturally occurring nucleic acid having sequences which correspond to a particular genotype of HCV genomic sequencing with HCV nucleic acid may block translation or transcription of such genotype. Therapeutic agents can be engineered to include all five genotypes for inclusivity.

C. Peptide and antibody composition

A further embodiment of the present invention features a composition of matter comprising a non-naturally occurring peptide of three or more amino acids corresponding to a nucleic acid having a non-HCV-1 sequence. Preferably, the non-HCV-1 sequence corresponds with a sequence within one or more regions consisting of the NS5 region, the envelope 1 region, the 5'UT region, and the core region.

Preferably, with respect to peptides corresponding to a nucleic acid having a non-HCV-1 sequence of the NS5 region, the sequence is within sequences numbered 2-22. The sequence numbered 1 corresponds to HCV-1. Sequences numbered 1-22 are set forth in the Sequence Listing.

Preferably, with respect to peptides corresponding to a nucleic acid having a non-HCV-1 sequence of the envelope 1 region, the sequence is within sequences numbered 24-32. The sequence numbered 23 corresponds to HCV-1. Sequences numbered 23-32 are set forth in the Sequence Listing.

Preferably, with respect to peptides corresponding to a nucleic acid having a non-HCV-1 sequence directed to the core region, the sequence is within sequences numbered 53-66. Sequence numbered 52 corresponds to HCV-1. Sequences numbered 52-66 are set forth in the Sequence Listing.

The further embodiment of the present invention features peptide compositions corresponding to nucleic acid sequences of a genotype of HCV. The first genotype, GI, is exemplified by sequences numbered 1-6, 23-25, 33-38 and 52-57. The second genotype, GII, is exemplified by the sequences numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GIII, is exemplified by sequences numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, is exemplified sequences numbered 20-22, 29-31, 48 and 49. The fifth genotype, GV, is exemplified by sequences numbered 18, 19, 50 and 51.

The non-naturally occurring peptides of the present invention are useful as a component of a vaccine. The sequence information of the present invention permits the design of vaccines which are inclusive for all or some of the different genotypes of HCV. Directing a vaccine to a particular genotype allows prophylactic treatment to be tailored to maximize the protection to those agents likely to be encountered. Directing a vaccine to more than one genotype allows the vaccine to be more inclusive.

The peptide compositions are also useful for the development of specific antibodies to the HCV proteins. One embodiment of the present invention features as a composition of matter, an antibody to peptides corresponding to a non-HCV-1 sequence of the HCV genome. Preferably, the non-HCV-1 sequence is selected from the sequence within a region consisting of the NS5 region, the envelope 1 region, and the core region. There are no peptides associated with the untranslated 5'UT region.

Preferably, with respect to antibodies directed to peptides of the NS5 region, the peptide corresponds to a sequence within sequences numbered 2-22. Preferably, with respect to antibodies directed to a peptide corresponding to the envelope 1 region, the peptide corresponds to a sequence within sequences numbered 24-32. Preferably, with respect to the antibodies directed to peptides corresponding to the core region, the peptide corresponds to a sequence within sequences numbered 53-66.

Antibodies directed to peptides which reflect a particular genotype have utility for the detection of such genotypes of HCV and therapeutic agents.

One embodiment of the present invention features an antibody directed to a peptide corresponding to nucleic acid having sequences of a particular genotype. The first genotype, GI, is exemplified by sequences numbered 1-6, 23-25, 33-38 and 52-57. The second genotype, GII, is exemplified by the sequences numbered 7-12, 26-28, 39-45 and 58-64. The third genotype, GIII, is exemplified by sequences numbered 13-17, 32, 46-47 and 65-66. The fourth genotype, GIV, is exemplified sequences numbered 20-22, 29-31, 48 and 49. The fifth genotype, GV, is exemplified by sequences numbered 18, 19, 50 and 51.

Individuals skilled in the art will readily recognize that the compositions of the present invention can be packaged with instructions for use in the form of a kit for performing nucleic acid hybridizations or immunochemical reactions.

The present invention is further described in the following figures which illustrate sequences demonstrating genotypes of HCV. The sequences are designated by numerals 1-145, which numerals and sequences are consistent with the numerals and sequences set forth in the Sequence Listing. Sequences 146 and 147 facilitate the discussion of an assay which numerals and sequences are consistent with the numerals and sequences set forth in the Sequence Listing.

BRIEF DESCRIPTION OF THE FIGURES AND SEQUENCE LISTING

FIG. 1 depicts schematically the genetic organization of HCV;

FIG. 2 sets forth nucleic acid sequences numbered 1-22 which sequences are derived from the NS5 region of the HCV viral genome;

FIG. 3 sets forth nucleic acid sequences numbered 23-32 which sequences are derived from the envelope 1 region of the HCV viral genome;

FIG. 4 sets forth nucleic acid sequences numbered 33-51 which sequences are derived from the 5'UT region of the HCV viral genome; and,

FIG. 5 sets forth nucleic acid sequences numbered 52-66 which sequences are derived from the core region of the HCV viral genome.

The Sequence Listing sets forth the sequences of sequences numbered 1-147.

DETAILED DESCRIPTION OF THE INVENTION

The present invention will be described in detail as as nucleic acid having sequences corresponding to the HCV genome and related peptides and binding partners, for diagnostic and therapeutic applications.

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See e.g., Maniatis, Fitsch & Sambrook, Molecular Cloning; A Laboratory Manual (1982); DNA Cloning, Volumes I and II (D. N Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed, 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); the series, Methods in Enzymology (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, eds.).

The cDNA libraries are derived from nucleic acid sequences present in the plasma of an HCV-infected chimpanzee. The construction of one of these libraries, the "c" library (ATCC No. 40394), is described in PCT Pub. No. WO90/14436. The sequences of the library relevant to the present invention are set forth herein as sequence numbers 1, 23, 33 and 52.

Nucleic acids isolated or synthesized in accordance with features of the present invention are useful, by way of example without limitation as probes, primers, anti-sense genes and for developing expression systems for the synthesis of peptides corresponding to such sequences.

The nucleic acid sequences described define genotypes of HCV with respect to four regions of the viral genome. FIG. 1 depicts schematically the organization of HCV. The four regions of particular interest are the NS5 region, the envelope 1 region, the 5'UT region and the core region.

The sequences set forth in the present application as sequences numbered 1-22 suggest at least five genotypes in the NS5 region. Sequences numbered 1-22 are depicted in FIG. 2 as well as the Sequence Listing. Each sequence numbered 1-22 is derived from nucleic acid having 340 nucleotides from the NS5 region.

The five genotypes are defined by groupings of the sequences defined by sequence numbered 1-22. For convenience, in the present application, the different genotypes will be assigned roman numerals and the letter "G".

The first genotype (GI) is exemplified by sequences within sequences numbered 1-6. A second genotype (GII) is exemplified by sequences within sequences numbered 7-12. A third genotype (GIII) is exemplified by the sequences within sequences numbered 13-17. A fourth genotype (GIV) is exemplified by sequences within sequences numbered 20-22. A fifth genotype (GV) is exemplified by sequences within sequences numbered 18 and 19.

The sequences set forth in the present application as sequences numbered 23-32 suggest at least four genotypes in the envelope 1 region of HCV. Sequences numbered 23-32 are depicted in FIG. 3 as well as in the Sequence Listing. Each sequence numbered 23-32 is derived from nucleic acid having 100 nucleotides from the envelope 1 region.

A first envelope 1 genotype group (GI) is exemplified by the sequences within the sequences numbered 23-25. A second envelope 1 genotype (GII) region is exemplified by sequences within sequences numbered 26-28. A third envelope 1 genotype (GIII) is exemplified by the sequences within sequences numbered 32. A fourth envelope 1 genotype (GIV) is exemplified by the sequences within sequence numbered 29-31.

The sequences set forth in the present application as sequences numbered 33-51 suggest at least three genotypes in the 5'UT region of HCV. Sequences numbered 33-51 are depicted in FIG. 4 as well as in the Sequence Listing. Each sequence numbered 33-51 is derived from the nucleic acid having 252 nucleotides from the 5'UT region, although sequences 50 and 51 are somewhat shorter at approximately 180 nucleotides.

The first 5'UT genotype (GI) is exemplified by the sequences within sequences numbered 33-38. A second 5'UT genotype (GII) is exemplified by the sequences within sequences numbered 39-45. A third 5'UT genotype (GIII) is exemplified by the sequences within sequences numbered 46-47. A fourth 5'UT genotype (GIV) is exemplified by sequences within sequences humbered 48 and 49. A fifth 5'UT genotype (GV) is exemplified by sequences within sequences numbered 50 and 51.

The sequences numbered 48-62 suggest at least three genotypes in the core region of HCV. The sequences numbered 52-66 are depicted in FIG. 5 as well as in the Sequence Listing.

The first core region genotype (GI) is exemplified by the sequences within sequences numbered 52-57. The second core region genotype (GII) is exemplified by sequences within sequences numbered 58-64. The third core region genotype (GIII) is exemplified by sequences within sequences numbered 65 and 66. Sequences numbered 52-65 are comprised of 549 nucleotides. Sequence numbered 66 is comprised of 510 nucleotides.

The various genotypes described with respect to each region are consistent. That is, HCV having features of the first genotype with respect to the NS5 region will substantially conform to features of the first genotype of the envelope 1 region, the 5'UT region and the core region.

Nucleic acid isolated or synthesized in accordance with the sequences set forth in sequence numbers 1-66 are useful as probes, primers, capture ligands and anti-sense agents. As probes, primers, capture ligands and anti-sense agents, the nucleic acid wil normally comprise approximately eight or more nucleotides for specificity as well as the ability to form stable hybridization products.

Probes

A nucleic acid isolated or synthesized in accordance with a sequence defining a particular genotype of a region of the HCV genome can be used as a probe to detect such genotype or used in combination with other nucleic acid probes to detect substantially all genotypes of HCV.

With the sequence information set forth in the present application, sequences of eight or more nucleotides are identified which provide the desired inclusivity and exclusivity with respect to various genotypes within HCV, and extraneous nucleic acid sequences likely to be encountered during hybridization conditions.

Individuals skilled in the art will readily recognize that the nucleic acid sequences, for use as probes, can be provided with a label to facilitate detection of a hybridization product.

Capture Ligand

For use as a capture ligand, the nucleic acid selected in the manner described above with respect to probes, can be readily associated with supports. The manner in which nucleic acid is associated with supports is well known. Nucleic acid having sequences corresponding to a sequence within sequences numbered 1-66 have utility to separate viral nucleic acid of one genotype from the nucleic acid of HCV of a different genotype. Nucleic acid isolated or synthesized in accordance with sequences within sequences numbered 1-66, used in combinations, have utility to capture substantially all nucleic acid of all HCV genotypes.

Primers

Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility as primers for the amplification of HCV sequences. With respect to polymerase chain reaction (PCR) techniques, nucleic acid sequences of eight or more nucleotides corresponding to one or more sequences of sequences numbered 1-66 have utility in conjunction with suitable enzymes and reagents to create copies of the viral nucleic acid. A plurality of primers having different sequences corresponding to more than one genotype can be used to create copies of viral nucleic acid for such genotypes.

The copies can be used in diagnostic assays to detect HCV virus. The copies can also be incorporated into cloning and expression vectors to generate polypeptides corresponding to the nucleic acid synthesized by PCR, as will be described in greater detail below.

Anti-sense

Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility as anti-sense genes to prevent the expression of HCV.

Nucleic acid corresponding to a genotype of HCV is loaded into a suitable carrier such as a liposome for introduction into a cell infected with HCV. A nucleic acid having eight or more nucleotides is capable of binding to viral nucleic acid or viral messenger RNA. Preferably, the anti-sense nucleic acid is comprised of 30 or more nucleotides to provide necessary stability of a hybridization product of viral nucleic acid or viral messenger RNA. Methods for loading anti-sense nucleic acid is known in the art as exemplified by U.S. Pat. No. 4,241,046 issued Dec. 23, 1980 to Papahadjopoulos et al.

Peptide Synthesis

Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility to generate peptides. The sequences exemplified by sequences numbered 1-32 and 52-66 can be cloned into suitable vectors or used to isolate nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and cloned into a suitable vector. The vector can be used to transform a suitable host organism such as E. coli and the peptide encoded by the sequences isolated.

Molecular cloning techniques are described in the text Molecular Cloning: A Laboratory Manual, Maniatis et al., Coldspring Harbor Laboratory (1982).

The isolated peptide has utility as an antigenic substance for the development of vaccines and antibodies directed to the particular genotype of HCV.

Vaccines and Antibodies

The peptide materials of the present invention have utility for the development of antibodies and vaccines.

The availability of cDNA sequences, or nucleotide sequences derived therefrom (including segments and modifications of the sequence), permits the construction of expression vectors encoding antigenically active regions of the peptide encoded in either strand. The antigenically active regions may be derived from the NS5 region, envelope 1 regions, and the core region.

Fragments encoding the desired peptides are derived from the cDNA clones using conventional restriction digestion or by synthetic methods, and are ligated into vectors which may, for example, contain portions of fusion sequences such as beta galactosidase or superoxide dismutase (SOD), preferably SOD. Methods and vectors which are useful for the production of polypeptides which contain fusion sequences of SOD are described in European Patent Office Publication number 0196056, published Oct. 1, 1986.

Any desired portion of the HCV cDNA containing an open reading frame, in either sense strand, can be obtained as a recombinant peptide, such as a mature or fusion protein; alternatively, a peptide encoded in the cDNA can be provided by chemical synthesis.

The DNA encoding the desired peptide, whether in fused or mature form, and whether or not containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any convenient host. Both eukaryotic and prokaryotic host systems are presently used in forming recombinant peptides. The peptide is then isolated from lysed cells or from the culture medium and purified to the extent needed for its intended use. Purification may be by techniques known in the art, for example, differential extraction, salt fractionation, chromatography on ion exchange resins, affinity chromatography, centrifugation, and the like. See, for example, Methods in Enzymology for a variety of methods for purifying proteins. Such peptides can be used as diagnostics, or those which give rise to neutralizing antibodies may be formulated into vaccines. Antibodies raised against these peptides can also be used as diagnostics, or for passive immunotherapy or for isolating and identifying HCV.

An antigenic region of a peptide is generally relatively small--typically 8 to 10 amino acids or less in length. Fragments of as few as 5 amino acids may characterize an antigenic region. These segments may correspond to NS5 region, envelope 1 region, and the core region of the HCV genome. The 5'UT region is not known to be translated. Accordingly, using the cDNAs of such regions, DNAs encoding short segments of HCV peptides corresponding to such regions can be expressed recombinantly either as fusion proteins, or as isolated peptides. In addition, short amino acid sequences can be conveniently obtained by chemical synthesis. In instances wherein the synthesized peptide is correctly configured so as to provide the correct epitope, but is too small to be immunogenic, the peptide may be linked to a suitable carrier.

A number of techniques for obtaining such linkage are known in the art, including the formation of disulfide linkages using N-succinimidyl-3-(2-pyridylthio)propionate (SPDP) and succinimidyl 4-(N-maleimido-methyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Ill., (if the peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue). These reagents create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of such disulfide/amide-forming agents are known. See, for example, Immun Rev (1982) 62:185. Other bifunctional coupling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are commercially available and include reactive esters of 6-maleimidocaprioc acid, 2-bromoacetic acid, 2-iodoacetic acid, 4-N-maleimido-methyl)cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can be activated by combining them with succinimide or 1-hydroxyl-2 nitro-4-sulfonic acid, sodium salt. Additional methods of coupling antigens employs the rotavirus/"binding peptide" system described in EPO Pub. No. 259,149, the disclosure of which is incorporated herein by reference. The foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be used.

Any carrier may be used which does not itself induce the production of antibodies harmful to the host. Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, such as latex functionalized Sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino acids, such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus particles. Especially useful protein substrates are serum albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins well known to those skilled in the art.

Peptides comprising HCV amino acid sequences encoding at least one viral epitope derived from the NS5, envelope 1, and core region are useful immunological reagents. The 5'UT region is not known to be translated. For example, peptides comprising such truncated sequences can be used as reagents in an immunoassay. These peptides also are candidate subunit antigens in compositions for antiserum production or vaccines. While the truncated sequences can be produced by various known treatments of native viral protein, it is generally preferred to make synthetic or recombinant peptides comprising HCV sequence. Peptides comprising these truncated HCV sequences can be made up entirely of HCV sequences (one or more epitopes, either contiguous or noncontiguous), or HCV sequences and heterologous sequences in a fusion protein. Useful heterologous sequences include sequences that provide for secretion from a recombinant host, enhance the immunological reactivity of the HCV epitope(s), or facilitate the coupling of the polypeptide to an immunoassay support or a vaccine carrier. See, E.G., EPO Pub. No. 116,201; U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783.

The size of peptides comprising the truncated HCV sequences can vary widely, the minimum size being a sequence of sufficient size to provide an HCV epitope, while the maximum size is not critical. For convenience, the maximum size usually is not substantially greater than that required to provide the desired HCV epitopes and function(s) of the heterologous sequence, if any. Typically, the truncated HCV amino acid sequence will range from about 5 to about 100 amino acids in length. More typically, however, the HCV sequence will be a maximum of about 50 amino acids in length, preferably a maximum of about 30 amino acids. It is usually desirable to select HCV sequences of at least about 10, 12 or 15 amino acids, up to a maximum of about 20 or 25 amino acids.

HCV amino acid sequences comprising epitopes can be identified in a number of ways. For example, the entire protein sequence corresponding to each of the NS5, envelope 1, and core regions can be screened by preparing a series of short peptides that together span the entire protein sequence of such regions. By starting with, for example, peptides of approximately 100 amino acids, it would be routine to test each peptide for the presence of epitope(s) showing a desired reactivity, and then testing progressively smaller and overlapping fragments from an identified peptides of 100 amino acids to map the epitope of interest. Screening such peptides in an immunoassay is within the skill of the art. It is also known to carry out a computer analysis of a protein sequence to identify potential epitopes, and then prepare peptides comprising the identified regions for screening.

The immunogenicity of the epitopes of HCV may also be enhanced by preparing them in mammalian or yeast systems fused with or assembled with particle-forming proteins such as, for example, that associated with hepatitis B surface antigen. See, e.g., U.S. Pat. No. 4,722,840. Constructs wherein the HCV epitope is linked directly to the particle-forming protein coding sequences produce hybrids which are immunogenic with respect to the HCV epitope. In addition, all of the vectors prepared include epitopes specific to HBV, having various degrees of immunogenicity, such as, for example, the pre-S peptide. Thus, particles constructed from particle forming protein which include HCV sequences are immunogenic with respect to HCV and HBV.

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. cerevisiae (P. Valenzuela et al. (1982)), as well as in, for example, mammalian cells (P. Valenzuela et al. 1984)). The formation of such particles has been shown to enhance the immunogenicity of the monomer subunit. The constructs may also include the immunodominant epitope of HBSAg, comprising the 55 amino acids of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre-S-HBSAg particle expressible in yeast are disclosed in EPO 174,444, published Mar. 19, 1986; hybrids including heterologous viral sequences for yeast expression are disclosed in EPO 175,261, published Mar. 26, 1966. These constructs may also be expressed in mammalian cells such as Chinese hamster ovary (CHO) cells using an SV40-dihydrofolate reductase vector (Michelle et al. (1984)).

In addition, portions of the particle-forming protein coding sequence may be replaced with codons encoding an HCV epitope. In this replacement, regions which are not required to mediate the aggregation of the units to form immunogenic particles in yeast of mammals can be deleted, thus eliminating additional HBV antigenic sites from competition with the HCV epitope.

Vaccines

Vaccines may be prepared from one or more immunogenic peptides derived from HCV. The observed homology between HCV and Flaviviruses provides information concerning the peptides which are likely to be most effective as vaccines, as well as the regions of the genome in which they are encoded.

Multivalent vaccines against HCV may be comprised of one or more epitopes from one or more proteins derived from the NS5, envelope 1, and core regions. In particular, vaccines are contemplated comprising one or more HCV proteins or subunit antigens derived from the NS5, envelope 1, and core regions. The 5'UT region is not known to be translated.

The preparation of vaccines which contain an immunogenic peptide as an active ingredient, is known to one skilled in the art. Typically, such vaccines are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The active immunogenic ingredients are often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-theronyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1-2-dipalmitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and RIBI, which contains three components extracted from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by measuring the amount of antibodies directed against an immunogenic peptide containing an HCV antigenic sequence resulting from administration of this peptide in vaccines which are also comprised of the various adjuvants.

The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkylene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0/5% to 10%, preferably 1%-2%. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like.

The examples below are provided for illustrative purposes and are not intended to limit the scope of the present invention.

I. Detection of HCV RNA from Serum

RNA was extracted from serum using guanidinium salt, phenol and chloroform according to the instructions of the kit manufacturer (RNAzol™ B kit, Cinna/Biotecx). Extracted RNA was precipitated with isopropanol and washed with ethanol. A total of 25 μl serum was processed for RNA isolation, and the purified RNA was resuspended in 5 μl diethyl pyrocarbonate treated water for subsequent cDNA synthesis.

II. cDNA Synthesis and Polymerase Chain Reaction (PCR) Amplification

Table 1 lists the sequence and position (with reference to HCV1) of all the PCR primers and probes used in these examples. Letter designations for nucleotides are consistent with 37 C.F.R. §1.821-1.825. Thus, the letters A, C, G, T, and U are used in the ordinary sense of adenine, cytosine, guanine, thymine, and uracil. The letter M means A or C; R means A or G; W means A or T/U; S means C or G; Y means C or T/U; K means G or T/U; V means A or C or G, not T/U; H means A or C or T/U, not G; D means A or G or T/U, not C; B means C or G or T/U, not A; N means (A or C or G or T/U) or (unknown or other). Table 1 is set forth below:

                  TABLE 1                                                          ______________________________________                                                                       Nucleotide                                       Seq. No.                                                                              Sequence (5'-3')       Position                                         ______________________________________                                         67     CAAACGTAACACCAACCGRCGCCCACAGG                                                                         374-402                                          68     ACAGAYCCGCAKAGRTCCCCCACG                                                                              1192-1169                                        69     GCAACCTCGAGGTAGACGTCAGCCTATCCC                                                                        509-538                                          70     GCAACCTCGTGGAAGGCGACAACCTATCCC                                                                        509-538                                          71     GTCACCAATGATTGCCCTAACTCGAGTATT                                                                        948-977                                          72     GTCACGAACGACTGCTCCAACTCAAG                                                                            948-973                                          73     TGGACATGATCGCTGGWGCYCACTGGGG                                                                          1375-1402                                        74     TGGAYATGGTGGYGGGGGCYCACTGGGG                                                                          1375-1402                                        75     ATGATGAACTGGTCVCCYAC   1308-1327                                        76     ACCTTVGCCCAGTTSCCCRCCATGGA                                                                            1453-1428                                        77     AACCCACTCTATGYCCGGYCAT 205-226                                          78     GAATCGCTGGGGTGACCG     171-188                                          79     CCATGAATCACTCCCCTGTGAGGAACTA                                                                          30-57                                            80     TTGCGGGGGCACGCCCAA     244-227                                          ______________________________________                                    

For cDNA synthesis and PCR amplification, a protocol developed by Perkin-Elmer/Cetus (GeneAmp® RNA PCR kit) was used. Both random hexamer and primers with specific complementary sequences to HCV were employed to prime the reverse transcription (RT) reaction. All processes, except for adding and mixing reaction components, were performed in a thermal cycler (MJ Research, Inc.). The first strand cDNA synthesis reaction was inactivated at 99° C. for 5 min, and then cooled at 50° C. for 5 min before adding reaction components for subsequent amplification. After an initial 5 cycles of 97° C. for 1 min, 50° C. for 2 min, and 72° C. for 3 min, 30 cycles of 94° C. for 1 min, 55° C. for 2 min, and 72° C. for 3 min followed, and then a final 7 min of elongation at 72° C.

For the genotyping analysis, sequences 67 and 68 were used as primers in the PCR reaction. These primers amplify a segment corresponding to the core and envelope regions. After amplification, the reaction products were separated on an agarose gel and then transferred to a nylon membrane. The immobilized reaction products were allowed to hybridize with a ³² P-labelled nucleic acid corresponding to either Genotype I (core or envelope 1) or Genotype II (core or envelope 1). Nucleic acid corresponding to Genotype 1 comprised sequences numbered 69 (core), 71 (envelope), and 73 (envelope). Nucleic acid corresponding to Genotype II comprised sequences numbered 70 (core), 72 (envelope), and 74 (envelope).

The Genotype I probes only hybridized to the product amplified from isolates which had Genotype I sequence. Similarly, Genotype II probes only hybridized to the product amplified from isolates which had Genotype II sequence.

In another experiment, PCR products were generated using sequences 79 and 80. The products were analyzed as described above except Sequence No. 73 was used to detect Genotype I, Sequence No. 74 was used to detect Genotype II, Sequence No. 77 (5'UT) was used to detect Genotype III, and Sequence No. 78 (5'UT) was used to detect Genotype IV. Each sequence hybridized in a genotype specific manner.

III. Detection of HCV GI-GIV using a sandwich hybridization assay for HCV RNA

An amplified solution phase nucleic acid sandwich hybridization assay format is described in this example. The assay format employs several nucleic acid probes to effect capture and detection. A capture probe nucleic acid is capable of associating a complementary probe bound to a solid support and HCV nucleic acid to effect capture. A detection probe nucleic acid has a first segment (A) that binds to HCV nucleic acid and a second segment (B) that hybridizes to a second amplifier nucleic acid. The amplifier nucleic acid has a first segment (B*) that hybridizes to segment (B) of the probe nucleic acid and also comprises fifteen iterations of a segment (C). Segment C of the amplifier nucleic acid is capable of hybridizing to three labeled nucleic acids.

Nucleic acid sequences which correspond to nucleotide sequences of the envelope 1 gene of Group I HCV isolates are set forth in sequences numbered 81-99. Table 2 sets forth the area of the HCV genome to which the nucleic acid sequences correspond and a preferred use of the sequences.

                  TABLE 2                                                          ______________________________________                                                                 Complement of                                          Probe Type Sequence No. Nucleotide Numbers                                     ______________________________________                                         Label      81            879-911                                               Label      82            912-944                                               Capture    83            945-977                                               Label      84            978-1010                                              Label      85           1011-1043                                              Label      86           1044-1076                                              Label      87           1077-1109                                              Capture    88           1110-1142                                              Label      89           1143-1175                                              Label      90           1176-1208                                              Label      91           1209-1241                                              Label      92           1242-1274                                              Capture    93           1275-1307                                              Label      94           1308-1340                                              Label      95           1341-1373                                              Label      96           1374-1406                                              Label      97           1407-1439                                              Capture    98           1440-1472                                              Label      99           1473-1505                                              ______________________________________                                    

Nucleic acid sequences which correspond to nucleotide sequences of the envelope 1 gene of Group II HCV isolates are set forth in sequences 100-118. Table 3 sets forth the area of the HCV genome to which the nucleic acid corresponds and the preferred use of the sequences.

                  TABLE 3                                                          ______________________________________                                                                 Complement of                                          Probe Type Sequence No. Nucleotide Numbers                                     ______________________________________                                         Label      100           879-911                                               Label      101           912-944                                               Capture    102           945-977                                               Label      103           978-1010                                              Label      104          1011-1043                                              Label      105          1044-1076                                              Label      106          1077-1109                                              Capture    107          1110-1142                                              Label      108          1143-1175                                              Label      109          1176-1208                                              Label      110          1209-1241                                              Label      111          1242-1274                                              Capture    112          1275-1307                                              Label      113          1308-1340                                              Label      114          1341-1373                                              Label      115          1374-1406                                              Label      116          1407-1439                                              Capture    117          1440-1472                                              Label      118          1473-1505                                              ______________________________________                                    

Nucleic acid sequences which correspond to nucleotide sequences in the C gene and the 5'UT region are set forth in sequences 119-145. Table 4 identifies the sequence with a preferred use.

                  TABLE 4                                                          ______________________________________                                         Probe Type          Sequence No.                                               ______________________________________                                         Capture             119                                                        Label               120                                                        Label               121                                                        Label               122                                                        Capture             123                                                        Label               124                                                        Label               125                                                        Label               126                                                        Capture             127                                                        Label               128                                                        Label               129                                                        Label               130                                                        Capture             131                                                        Label               132                                                        Label               133                                                        Label               134                                                        Label               135                                                        Capture             136                                                        Label               137                                                        Label               138                                                        Label               139                                                        Capture             140                                                        Label               141                                                        Label               142                                                        Label               143                                                        Capture             144                                                        Label               145                                                        ______________________________________                                    

The detection and capture probe HCV-specific segments, and their respective names as used in this assay were as follows.

Capture sequences are sequences numbered 119-122 and 141-144.

Detection sequences are sequences numbered 119-140.

Each detection sequence contained, in addition to the sequences substantially complementary to the HCV sequences, a 5' extension (B) which extension (B) is complementary to a segment of the second amplifier nucleic acid. The extension (B) sequence is identified in the Sequence Listing as Sequence No. 146, and is reproduced below.

    AGGCATAGGACCCGTGTCTT

Each capture sequence contained, in addition to the sequences substantially complementary to HCV sequences, a sequence complementary to DNA bound to a solid phase. The sequence complementary to DNA bound to a solid support was carried downstream from the capture sequence. The sequence complementary to the DNA bound to the support is set forth as Sequence No. 147 and is reproduced below.

    CTTCTTTGGAGAAAGTGGTG

Microtiter plates were prepared as follows. White Microlite 1 Removawell strips (polystyrene microtiter plates, 96 wells/plate) were purchased from Dynatech Inc.

Each well was filled with 200 μl 1 N HCl and incubated at room temperature for 15-20 min. The plates were then washed 4 times with 1× PBS and the wells aspirated to remove liquid. The wells were then filled with 200 μl 1 N NaOH and incubated at room temperature for 15-20 min. The plates were again washed 4 times with 1× PBS and the wells aspirated to remove liquid.

Poly(phe-lys) was purchased from Sigma Chemicals, Inc. This polypeptide has a 1:1 molar ratio of phe:lys and an average m.w. of 47,900 gm/mole. It has an average length of 309 amino acids and contains 155 amines/mole. A 1 mg/ml solution of the polypeptide was mixed with 2M NaCl/1× PBS to a final concentration of 0.1 mg/ml (pH 6.0). A volume of 200 μl of this solution was added to each well. The plate was wrapped in plastic to prevent drying and incubated at 30° C. overnight. The plate was then washed 4 times with 1× PBS and the wells aspirated to remove liquid.

The following procedure was used to couple the nucleic acid, a complementary sequence to Sequence No. 147, to the plates, hereinafter referred to as immobilized nucleic acid. Synthesis of immobilized nucleic acid having a sequence complementary to Sequence No. 133 was described in EPA 883096976. A quantity of 20 mg disuccinimidyl suberate was dissolved in 300 μl dimethyl formamide (DMF). A quantity of 26 OD₂₆₀ units of immobilized nucleic acid was added to 100 μl coupling buffer (50 mM sodium phosphate, pH 7.8). The coupling mixture was then added to the DSS-DMF solution and stirred with a magnetic stirrer for 30 min. An NAP-25 column was equilibrated with 10 mM sodium phosphate, pH 6.5. The coupling mixture DSS-DMF solution was added to 2 ml 10 mM sodium phosphate, pH 6.5, at 4° C. The mixture was vortexed to mix and loaded onto the equilibrated NAP-25 column. DSS-activated immobilized nucleic acid DNA was eluted from the column with 3.5 ml 10 mM sodium phosphate, pH 6.5. A quantity of 5.6 OD₂₆₀ units of eluted DSS-activated immobilized nucleic acid DNA was added to 1500 ml 50 mM sodium phosphate, pH 7.8. A volume of 50 μl of this solution was added to each well and the plates were incubated overnight. The plate was then washed 4 times with 1× PBS and the wells aspirated to remove liquid.

Final stripping of plates was accomplished as follows. A volume of 200 μl of 0.2N NaOH containing 0.5% (w/v) SDS was added to each well. The plate was wrapped in plastic and incubated at 65° C. for 60 min. The plate was then washed 4 times with 1× PBS and the wells aspirated to remove liquid. The stripped plate was stored with desiccant beads at 2-8° C.

Serum samples to be assayed were analyzed using PCR followed by sequence analysis to determine the genotype.

Sample preparation consisted of delivering 50 μl of the serum sample and 150 μl P-K Buffer (2 mg/ml proteinase K in 53 mM Tris-HCl, pH 8.0/0.6 M NaCl/0.06 M sodium citrate/8 mM EDTA, pH 8.0/1.3% SDS/16 μg/ml sonicated salmon sperm DNA/7% formamide/50 fmoles capture probes/160 fmoles detection probes) to each well. Plates were agitated to mix the contents in the well, covered and incubated for 16 hr at 62° C.

After a further 10 minute period at room temperature, the contents of each well were aspirated to remove all fluid, and the wells washed 2× with washing buffer (0.1% SDS/0.015 M NaCl/0.0015 M sodium citrate). The amplifier nucleic acid was then added to each well (50 μl of 0.7 fmole/μl solution in 0..48 M NaCl/0.048 M sodium citrate/0.1% SDS/0.5% "blocking reagent" (Boehringer Mannheim, catalog No. 1096 176)). After covering the plates and agitating to mix the contents in the wells, the plates were incubated for 30 min. at 52° C.

After a further 10 min period at room temperature, the wells were washed as described above.

Alkaline phosphatase label nucleic acid, disclosed in EP 883096976, was then added to each well (50 μl/well of 2.66 fmoles/μl). After incubation at 52° C. for 15 min., and 10 min. at room temperature, the wells were washed twice as above and then 3× with 0.015 M NaCl/0.0015 M sodium citrate.

An enzyme-triggered dioxetane (Schaap et al., Tet. Lett. (1987) 28:1159-1162 and EPA Pub. No. 0254051), obtained from Lumigen, Inc., was employed. A quantity of 50 μl Lumiphos 530 (Lumigen) was added to each well. The wells were tapped lightly so that the reagent would fall to the bottom and gently swirled to distribute the reagent evenly over the bottom. The wells were covered and incubated at 37° C. for 20-40 min.

Plates were then read on a Dynatech ML 1000 luminometer. Output was given as the full integral of the light produced during the reaction.

The assay positively detected each of the serum samples, regardless of genotype.

IV. Expression of the Polypeptide Encoded in Sequences Defined by Differing Genotypes

HCV polypeptides encoded by a sequence within sequences 1-66 are expressed as a fusion polypeptide with superoxide dismutase (SOD). A cDNA carrying such sequences is subcloned into the expression vector pSODcfl (Steimer et al. 1986)).

First, DNA isolated from pSODcfl is treated with BamHI and EcoRI, and the following linker was ligated into the linear DNA created by the restriction enzymes:

    5 GAT CCT GGA ATT CTG ATA AGA CCT TAA GAC TAT TTT AA 3

After cloning, the plasmid containing the insert is isolated.

Plasmid containing the insert is restricted with EcoRI. The HCV cDNA is ligated into this EcoRI linearized plasmid DNA. The DNA mixture is used to transform E. coli strain D1210 (Sadler et al. (1980)). Polypeptides are isolated on gels.

V. Antigenicity of Polypeptides

The antigenicity of polypeptides formed in Section IV is evaluated in the following manner. Polyethylene pins arranged on a block in an 8 12 array (Coselco Mimetopes, Victoria, Australia) are prepared by placing the pins in a bath (20% v/v piperidine in dimethylformamide (DMF)) for 30 minutes at room temperature. The pins are removed, washed in DMF for 5 minutes, then washed in methanol four times (2 min/wash). The pins are allowed to air dry for at least 10 minutes, then washed a final time in DMF (5 Min). 1-Hydroxybenzotriazole (HOBt, 367 mg) is dissolved in DMF (80 μL) for use in coupling Fmoc-protected polypeptides prepared in Section IV.

The protected amino acids are placed in micro-titer plate wells with HOBt, and the pin block placed over the plate, immersing the pins in the wells. The assembly is then sealed in a plastic bag and allowed to react at 25° C. for 18 hours to couple the first amino acids to the pins. The block is then removed, and the pins washed with DMF (2 min.), MeOH (4×, 2 min.), and again with DMF (2 min.) to clean and deprotect the bound amino acids. The procedure is repeated for each additional amino acid coupled, until all octamers are prepared.

The free N-termini are then acetylated to compensate for the free amide, as most of the epitopes are not found at the N-terminus and thus would not have the associated positive charge. Acetylation is accomplished by filling the wells of a microtiter plate with DMF/acetic anhydride/triethylamine (5:2:1 v/v/v) and allowing the pins to react in the wells for 90 minutes at 20° C. The pins are then washed with DMF (2 min.) and MeOH (4×, 2 min.), and air dried for at least 10 minutes.

The side chain protecting groups are removed by treating the pins with trifluoroacetic acid/phenol/dithioethane (95:2.5:1.5, v/v/v) in polypropylene bags for 4 hours at room temperature. The pins are then washed in dichloromethane (2×, 2 min.), 5% di-isopropylethylamine/dichloromethane (2×, 5 min.), dichloromethane. (5 min.), and air-dried for at least 10 minutes. The pins are then washed in water (2 min.), MeOH (18 hours), dried in vacuo, and stored in sealed plastic bags over silica gel. IV.B.15.b Assay of Peptides.

Octamer-bearing pins are treated by sonicating for 30 minutes in a disruption buffer (1% sodium dodecylsulfate, 0.1% 2-mercaptoethanol, 0.1 M NaH2PO4) at 60° C. The pins are then immersed several times in water (60° C.), followed by boiling MeOH (2 min.), and allowed to air dry.

The pins are then precoated for 1 hour at 25° C. in microtiter wells containing 200 μL blocking buffer (1% ovalbumin, 1% BSA, 0.1% Tween, and 0.05% NaN3 in PBS), with agitation. The pins are then immersed in microtiter wells containing 175 μL antisera obtained from human patients diagnosed as having HCV and allowed to incubate at 4° C. overnight. The formation of a complex between polyclonal antibodies of the serum and the polypeptide initiates that the peptides give rise to an immune response in vivo. Such peptides are candidates for the development of vaccines.

Thus, this invention has been described and illustrated. It will be apparent to those skilled in the art that many variations and modifications can be made without departing from the purview of the appended claims and without departing from the teaching and scope of the present invention.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES:  147                                           - (2) INFORMATION FOR SEQ ID NO: 1:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH:  340 nuc - #leotides                                               (B) TYPE:  nucleic a - #cid                                                    (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY:  linear                                                -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:  (ATCC # 403 - #94)                                # ns5hcv1 (C) INDIVIDUAL ISOLATE:                                              #1:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGCG ACATCCGTAC GGAGGAGGCA                                  #    80            ACCT CGACCCCCAA GCCCGCGTGG                                  #   120            CGAG AGGCTTTATG TTGGGGGCCC                                  #   160            GGGG AGAACTGCGG CTATCGCAGG                                  #   200            TACT GACAACTAGC TGTGGTAACA                                  #   240            CAAG GCCCGGGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTCGT GTGTGGCGAC                                  #   320            GTGA AAGCGCGGGG GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO: 2:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH:  340 nuc - #leotides                                               (B) TYPE:  nucleic a - #cid                                                    (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY:  linear                                                -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5i21  (C) INDIVIDUAL ISOLATE:                                              #2:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGCG ACATCCGTAC GGAGGAGGCA                                  #    80            ACCT GGACCCCCAA GCCCGCATGG                                  #   120            TGAG AGGCTTTATG TCGGGGGCCC                                  #   160            GGGG AGAACTGCGG CTACCGCAGG                                  #   200            TACT GACAACTAGC TGTGGTAACA                                  #   240            CAAG GCCCGGGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTTGT GTGTGGCGAC                                  #   320            GTGA AAGTGCGGGG GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO: 3:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH:  340 nuc - #leotides                                               (B) TYPE:  nucleic a - #cid                                                    (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY:  linear                                                -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5pt1  (C) INDIVIDUAL ISOLATE:                                              #3:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGCG ACATCCGTAC GGAGGAGGCA                                  #    80            ATCT GGACCCCCAA GCCCGCGTGG                                  #   120            TGAG AGGCTTTACG TTGGGGGCCC                                  #   160            GGGG AGAACTGCGG CTACCGCAGG                                  #   200            TACT GACAACTAGC TGTGGTAATA                                  #   240            CAAG GCCCGGGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTCGT GTGTGGTGAC                                  #   320            GTGA GAGTGCGGGG GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:4:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH:  340 nuc - #leotides                                               (B) TYPE:  nucleic a - #cid                                                    (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY:  linear                                                -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5gm2  (C) INDIVIDUAL ISOLATE:                                              #4:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACG ACATCCGTAC GGAGGAGGCA                                  #    80            ACCT GGACCCCCAA GCCCGCGTGG                                  #   120            TGAG AGGCTTTATG TTGGGGGCCC                                  #   160            GGGG AAAACTGCGG CTATCGCAGG                                  #   200            TACT GACAACTAGC TGTGGTAACA                                  #   240            TAAG GCCCGGGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTCGT GTGTGGCGAC                                  #   320            GTGA GAGTGCGGGA GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:5:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH:  340 nuc - #leotides                                               (B) TYPE:  nucleic a - #cid                                                    (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY:  linear                                                -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5us17 (C) INDIVIDUAL ISOLATE:                                              #5:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGCG ATATCCGTAC GGAGGAGGCA                                  #    80            ACCT GGACCCCCAA GCCCGCGTGG                                  #   120            CGAG AGGCTTTATG TCGGGGGCCC                                  #   160            GGGG AAAACTGCGG CTATCGCAGG                                  #   200            TACT GACAACTAGC TGTGGTAACA                                  #   240            CAAG GCCCAAGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTCGT GTGTGGCGAC                                  #   320            GTGA AAGTCAGGGA GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:6:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5sp2  (C) INDIVIDUAL ISOLATE:                                              #6:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGCG ATATCCGTAC GGAGGAGGCA                                  #    80            ACCT GGACCCCGAA GCCCGTGTGG                                  #   120            TGAG AGGCTTTATG TTGGGGGCCC                                  #   160            GGGG AGAACTGCGG CTACCGCAGG                                  #   200            TACT GACGACTAGC TGTGGTAATA                                  #   240            CAAG GCCCGGGCAG CCTGTCGAGC                                  #   280            TGCA CCATGCTCGT GTGTGGCGAC                                  #   320            GCGA AAGTGCGGGG GTCCAGGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:7:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5j1   (C) INDIVIDUAL ISOLATE:                                              #7:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AATG ACACCCGTGT TGAGGAGTCA                                  #    80            ACTT GGCCCCCGAA GCCAGACAGG                                  #   120            AGAG CGGCTCTATG TCGGGGGTCC                                  #   160            GGGC AGAACTGCGG CTATCGCCGG                                  #   200            TGCT GACGACTAGC TGCGGTAATA                                  #   240            GAAG GCCACAGCGG CCTGTCGAGC                                  #   280            TGCA CGATGCTCGT GAACGGAGAC                                  #   320            GTGA AAGCGCGGGG AACCAAGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:8:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5k1   (C) INDIVIDUAL ISOLATE:                                              #8:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AATG ACATCCGTGT TGAGGAGTCA                                  #    80            ACTT GGCCCCCGAG GCCAGACAAG                                  #   120            AGAG CGGCTTTACA TCGGGGGCCC                                  #   160            GGGC AGAACTGCGG CTATCGCCGA                                  #   200            TGCT GACGACTAGC TGCGGTAATA                                  #   240            GAAG GCCACTGCGG CCTGTAGAGC                                  #   280            TGCA CGATGCTCGT GTGCGGAGAC                                  #   320            GTGA AAGCGCGGGA ACCCAGGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:9:                                             -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5k1.1 (C) INDIVIDUAL ISOLATE:                                              #9:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AATG ACATCCGTGT TGAGGAGTCA                                  #    80            CCTT GGCCCCCGAG GCTAGACAGG                                  #   120            AGAG CGGCTTTATA TCGGGGGCCC                                  #   160            GGGC AGAACTGCGG TTATCGCCGG                                  #   200            TACT GACGACCAGC TGCGGTAATA                                  #   240            GAAG GCCTCTGCAG CCTGTCGAGC                                  #   280            TGCA CGATGCTCGT GTGTGGGGAC                                  #   320            GTGA AAGCGCGGGA ACCCAGGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:10:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5gh6  (C) INDIVIDUAL ISOLATE:                                              #10:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGTG ACATCCGTGT CGAGGAGTCG                                  #    80            ACTT GGCCCCCGAA GCCAGGCAGG                                  #   120            CGAG CGACTTTATA TCGGGGGCCC                                  #   160            GGGC AGAACTGCGG TTATCGCCGG                                  #   200            TGCT GACGACTAGC TGCGGTAATA                                  #   240            GAAG GCCTCTGCAG CCTGTCGAGC                                  #   280            TGCA CGATGCTCGT GAACGGGGAC                                  #   320            GCGA GAGCGCGGGA ACCCAAGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:11:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5sp1  (C) INDIVIDUAL ISOLATE:                                              #11:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGTG ACATCCGTGT TGAGGAGTCA                                  #    80            ACTT GGCCCCCGAA GCCAGACAGG                                  #   120            AGAG CGGCTGTACA TCGGGGGTCC                                  #   160            GGGC AGAACTGCGG CTATCGCCGG                                  #   200            TGCT GACGACTAGC TGCGGTAACA                                  #   240            GAAG GCCTCTGCGG CCTGTCGAGC                                  #   280            TGCA CGATGCTCGT GTGCGGTGAC                                  #   320            GTGA GAGCGCGGGA ACCCAAGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:12:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5sp3  (C) INDIVIDUAL ISOLATE:                                              #12:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGTG ACATCCGTGT TGAGGAGTCA                                  #    80            ACTT GGCCCCCGAA GCCAGACAGG                                  #   120            AGAG CGGCTTTACA TCGGGGGTCC                                  #   160            GGGC AGAACTGCGG CTATCGCCGG                                  #   200            TGCT GACGACTAGC TGCGGTAATA                                  #   240            GAAG GCCAGTGCGG CCTGTCGAGC                                  #   280            TGCA CAATGCTCGT GTGCGGTGAC                                  #   320            GTGA GAGCGCGGGG ACCCAAGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:13:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5k2   (C) INDIVIDUAL ISOLATE:                                              #13:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGAG ACATCAGAAC TGAGGAGTCC                                  #    80            CCCT GCCTGAGGAG GCTCACATTG                                  #   120            TGAG AGGCTCTACG TGGGAGGGCC                                  #   160            GGCC AGACCTGCGG GTACAGGCGT                                  #   200            TGCT CACCACTAGC ATGGGGAACA                                  #   240            AAAA GCCCTAGCGG CTTGCAAGGC                                  #   280            CCCT CAATGCTGGT ATGCGGCGAC                                  #   320            CAGA AAGCCAGGGG ACTGAGGAGG                                  #340               AGCT                                                        - (2) INFORMATION FOR SEQ ID NO:14:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5arg8 (C) INDIVIDUAL ISOLATE:                                              #14:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AAGG ACATCACATC CTAGGAGTCC                                  #    80            CACT GCCCGAGGAG GCTCGAACTG                                  #   120            TGAG AGACTATACG TAGGGGGGCC                                  #   160            GGCC AATCCTGCGG GTACAGGCGT                                  #   200            TGCT CACCACCAGC ATGGGCAACA                                  #   240            AAAA GCCAGGGCGG CGTGTAACGC                                  #   280            CCCA CCATGCTGGT GTGCGGTGAC                                  #   320            CAGA GAGTCAAGGG GCTGAGGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:15:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5i10  (C) INDIVIDUAL ISOLATE:                                              #15:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGGG ACATCAGAAC CGAGGAGTCC                                  #    80            CACT GCCTGAGGAG GCCCGAACTG                                  #   120            TGAG AGACTGTACG TAGGGGGGCC                                  #   160            GGGC AATCCTGCGG GTACAGGCGT                                  #   200            TGCT CACCACCAGC ATGGGCAACA                                  #   240            GAAA GCCAGAGCGG CGTGTAACGC                                  #   280            CCCA CCATGTTGGT GTGCGGCGAC                                  #   320            CAGA GAGTCAGGGG GTCGAGGAAG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:16:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5arg6 (C) INDIVIDUAL ISOLATE:                                              #16:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AGGG ACATCAGAAC CGAGGAGTCC                                  #    80            CACT GCCTGAGGAG GCTCGAACTG                                  #   120            TGAG AGGCTGTACG TAGGGGGGCC                                  #   160            GGGC AATCCTGCGG GTACAGGCGT                                  #   200            TGCT CACCACCAGC ATGGGTAACA                                  #   240            GAAA GCTAAAGCGG CATGTAACGC                                  #   280            CCCA CCATGTTGGT GTGCGGCGAC                                  #   320            CAGA GAGTCAAGGG GTCGAGGAGG                                  #340               AGCT                                                        - (2) INFORMATION FOR SEQ ID NO:17:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5k2b  (C) INDIVIDUAL ISOLATE:                                              #17:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #   40             AGGG ACATAAGAAC AGAAGAATCC                                  #   80             CCCT GCCTCAGGAG GCTAGAACTG                                  #   120            TGAG AGACTCTACG TAGGAGGGCC                                  #   160            GGAC AATCCTGCGG TTACAGGCGT                                  #   200            TCTT CACCACCAGC ATGGGGAATA                                  #   240            CAAA GCCCTTGCAG CGTGCAAAGC                                  #   280            CCTA TCATGCTGGT GTGTGGAGAC                                  #   320            CGGA GAGCGAAGGT AACGAGGAGG                                  #340               AGCT                                                        - (2) INFORMATION FOR SEQ ID NO:18:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5sa283(C) INDIVIDUAL ISOLATE:                                              #18:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            CATG ACATAATGAC TGAAGAGTCT                                  #    80            ACTT GCAGCCTGAG GCGCGTGTGG                                  #   120            CCAA CGCCTGTACT GTGGAGGCCC                                  #   160            GGGC AACAATGTGG TTATCGTAGA                                  #   200            TCTT CACCACTAGT ATGGGCAACA                                  #   240            TAAG GCTTTAGCCT CCTGTAGAGC                                  #   280            TGCA CGCTCCTGGT GTGTGGTGAT                                  #   320            GCGA GAGCCAGGGG ACGCACGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:19:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5sa156(C) INDIVIDUAL ISOLATE:                                              #19:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            CATG ACATAATGAC TGAAGAGTCC                                  #    80            ACTT GCAGCCTGAG GCACGCGCGG                                  #   120            CCAA CGCCTGTACT GTGGAGGCCC                                  #   160            GGGC AACAATGTGG TTACCGTAGA                                  #   200            TCTT CACCACCAGT ATGGGCAACA                                  #   240            CAAG GCTTCAGCCG CCTGTAGAGC                                  #   280            TGCA CGCTCCTGGT GTGTGGTGTG                                  #   320            GCGA GAGCCAAGGG ACGCACGAGG                                  #340               AGTC                                                        - (2) INFORMATION FOR SEQ ID NO:20:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5i11  (C) INDIVIDUAL ISOLATE:                                              #20:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            CAGG ACATCAGGGT GGAAGAGGAG                                  #    80            ACCT TGAACCGGAG GCCAGGAAAG                                  #   120            GGAG CGGCTTTACT GCGGGGGCCC                                  #   160            GGGG CCCAGTGTGG TTATCGCCGT                                  #   200            TCCT GCCTACCAGC TTCGGCAACA                                  #   240            CAAG GCTAGAGCGG CTTCGAAGGC                                  #   280            CCGG ACTTTCTTGT CTGCGGAGAT                                  #   320            CTGA GAGTGATGGC GTCGACGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:21:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5i4   (C) INDIVIDUAL ISOLATE:                                              #21:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            CAGG ACATCAGGGT GGAAGAGGAG                                  #    80            ACCT TGAACCGGAG GCCAGGAAAG                                  #   120            GGAG CGGCTTTACT GCGGGGGCCC                                  #   160            GGGG CCCAGTGTGG TTATCGCCGT                                  #   200            TTCT GCCTACCAGC TTCGGCAACA                                  #   240            CAAG GCTAGAGCGG CTGCGAAGGC                                  #   280            CCGG ACTTTCTCGT CTGCGGAGAT                                  #   320            CTGA GAGTGATGGC GTCGACGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:22:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 340 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ns5gh8  (C) INDIVIDUAL ISOLATE:                                              #22:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            CAGG ACATCAGGGT GGAAGAGGAG                                  #    80            ACCT TGAACCGGAG GCCAGGAAAG                                  #   120            GGAA CGGCTTTACT GCGGGGGCCC                                  #   160            GGGG CCCAGTGTGG TTATCGCCGT                                  #   200            TTCT GCCTACCAGC TTCGGCAACA                                  #   240            CAAA GCTAGAGCGG CTGCCGAAGC                                  #   280            CCGG ACTTTCTTGT CTGCGGAGAT                                  #   320            CTGA GAGTGATGGC GTCAATGAGG                                  #340               AGCC                                                        - (2) INFORMATION FOR SEQ ID NO:23:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:  (ATCC # 403 - #94)                                # hcv1    (C) INDIVIDUAL ISOLATE:                                              #23:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCTC AGCTGCTCCG GATCCCACAA                                  #    80            TCGC TGGTGCTCAC TGGGGAGTCC                                  #100               TTTC                                                        - (2) INFORMATION FOR SEQ ID NO:24:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # US5     (C) INDIVIDUAL ISOLATE:                                              #24:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCTC AGGTACTCCG GATCCCACAA                                  #    80            TCGC TGGAGCCCAC TGGGGAGTCC                                  #100               TTTC                                                        - (2) INFORMATION FOR SEQ ID NO:25:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # AUS5    (C) INDIVIDUAL ISOLATE:                                              #25:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCTC AGCTGCTCAG GGTCCCGCAA                                  #    80            TCGC TGGTGCCCAC TGGGGAGTCC                                  #100               TTTT                                                        - (2) INFORMATION FOR SEQ ID NO:26:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # US4     (C) INDIVIDUAL ISOLATE:                                              #26:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TCGC AGTTACTCCG GATCCCACAA                                  #    80            TGGC GGGGGCCCAC TGGGGAGTCC                                  #100               CTAT                                                        - (2) INFORMATION FOR SEQ ID NO:27:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # ARG2    (C) INDIVIDUAL ISOLATE:                                              #27:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TCGC AGTTACTCCG GATCCCACAA                                  #    80            TGGC GGGGGCCCAC TGGGGAGTCC                                  #100               CTAT                                                        - (2) INFORMATION FOR SEQ ID NO:28:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # I15     (C) INDIVIDUAL ISOLATE:                                              #28:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TCGC AGTTACTCCG GATCCCGCAA                                  #    80            TGGC GGGGGCCCAC TGGGGAATCC                                  #100               CTAT                                                        - (2) INFORMATION FOR SEQ ID NO:29:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # GH8     (C) INDIVIDUAL ISOLATE:                                              #29:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCGC ACGTCCTGCG TTTGCCCCAG                                  #    80            TAGC CGGGGCCCAT TGGGGCATCT                                  #100               TTAC                                                        - (2) INFORMATION FOR SEQ ID NO:30:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # I4      (C) INDIVIDUAL ISOLATE:                                              #30:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCAC ACGTCCTGCG TCTGCCCCAG                                  #    80            TAGC CGGGGCCCAT TGGGGCATCT                                  #100               TTAC                                                        - (2) INFORMATION FOR SEQ ID NO:31:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # I11     (C) INDIVIDUAL ISOLATE:                                              #31:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCGC AAGTCCTGCG TTTGCCCCAG                                  #    80            TAGC CGGGGCCCAT TGGGGCATCT                                  #100               TTAC                                                        - (2) INFORMATION FOR SEQ ID NO:32:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 100 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # I10     (C) INDIVIDUAL ISOLATE:                                              #32:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            GCAT ACTTGGTGCG CATCCCGGAG                                  #    80            TCAC GGGAGGACAC TGGGGCGTGA                                  #100               TTTC                                                        - (2) INFORMATION FOR SEQ ID NO:33:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:  (ATCC # 403 - #94)                                # hcv1    (C) INDIVIDUAL ISOLATE:                                              #33:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:34:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # us5     (C) INDIVIDUAL ISOLATE:                                              #34:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:35:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # aus1    (C) INDIVIDUAL ISOLATE:                                              #35:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCACGCCCC CGCAAGATCA                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:36:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sp2     (C) INDIVIDUAL ISOLATE:                                              #36:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATAAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:37:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # gm2     (C) INDIVIDUAL ISOLATE:                                              #37:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:38:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i21     (C) INDIVIDUAL ISOLATE:                                              #38:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATAAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:39:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # us4     (C) INDIVIDUAL ISOLATE:                                              #39:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:40:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # jh1     (C) INDIVIDUAL ISOLATE:                                              #40:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:41:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # nac5    (C) INDIVIDUAL ISOLATE:                                              #41:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:42:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # arg2    (C) INDIVIDUAL ISOLATE:                                              #42:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:43:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sp1     (C) INDIVIDUAL ISOLATE:                                              #43:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:44:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # gh1     (C) INDIVIDUAL ISOLATE:                                              #44:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:45:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i15     (C) INDIVIDUAL ISOLATE:                                              #45:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:46:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i10     (C) INDIVIDUAL ISOLATE:                                              #46:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TACA GCCTCCAGGC CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CTGG GTCCTTTCTT GGATAAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT TGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:47:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # arg6    (C) INDIVIDUAL ISOLATE:                                              #47:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TACA GCCTCCAGGC CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CTGG GTCCTTTCTT GGATAAACCC                                  #   160            TTTG GGCGTGCCCC CGCAAGACTG                                  #   200            GGGT TGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:48:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # s21     (C) INDIVIDUAL ISOLATE:                                              #48:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CTCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGAGCAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGATCA                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:49:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 252 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # gj61329 (C) INDIVIDUAL ISOLATE:                                              #49:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            TGCA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGAGTAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGATCA                                  #   200            GGGT CGCGAAAGGC CTTGTGGTAC                                  #   240            TGCG AGTGCCCCGG GAGGTCTCGT                                  #      252                                                                     - (2) INFORMATION FOR SEQ ID NO:50:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 180 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sa3     (C) INDIVIDUAL ISOLATE:                                              #50:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACA GCCTCCAGGA CCCCCCCTCC                                  #    80            GTCT GCGGAACCGG TGAGTACACC                                  #   120            CCGG GTCCTTTCTT GGATAAACCC                                  #   160            TTTG GGCGTGCCCC CGCGAGACTG                                  #180               GGGT                                                        - (2) INFORMATION FOR SEQ ID NO:51:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 180 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sa4     (C) INDIVIDUAL ISOLATE:                                              #51:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    - GTTAGTATGA GTGTCGAACA GCCTCCAGGA CCCCCCCTCC    40                            - CGGGAGAGCC ATAGTGGTCT GCGGAACCGG TGAGTACACC    80                            - GGAATTGCCG GGATGACCGG GTCCTTTCTT GGATAAACCC   120                            - GCTCAATGCC CGGAGATTTG GGCGTGCCCC CGCGAGACTG   160                            #             180TTGGGT                                                        - (2) INFORMATION FOR SEQ ID NO:52:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:  (ATCC # 403 - #94)                                # hcv1    (C) INDIVIDUAL ISOLATE:                                              #52:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAAAAAA AACAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGA AAGACTTCCG                                  #   200            AGGT AGACGTCAGC CTATCCCCAA                                  #   240            GGCA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGC TGCGGGTGGG                                  #   320            TCCC CGTGGCTCTC GGCCTAGCTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCTC TTGGAGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAACCTTCCT GGTTGCTCTT                                  #           549    GGCC CTGCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:53:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # us5     (C) INDIVIDUAL ISOLATE:                                              #53:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAGACTTCCG                                  #   200            AGGT AGACGTCAGC CTATCCCCAA                                  #   240            GGCA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT TGCGGGTGGG                                  #   320            TCCC CGTGGCTCTC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCACA                                  #   440            CGTC GGCGCCCCTC TTGGAGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAACCTTCCT GGTTGCTCTT                                  #           549    GGCC CTGCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:54:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # aus1    (C) INDIVIDUAL ISOLATE:                                              #54:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTTAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAGACTTCCG                                  #   200            AGGT AGACGTCAGC CTATCCCTAA                                  #   240            GGCA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG TAATGAGGGT TGCGGATGGG                                  #   320            CCCC CGTGGCTCTC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TCAC GTGCGGCTTC GCCGACCACA                                  #   440            CGTT GGCGCCCCTC TTGGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAATCTTCCT GGTTGCTCTT                                  #           549    GGCC CTTCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:55:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sp2     (C) INDIVIDUAL ISOLATE:                                              #55:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CACGACGAGG AAGACTTCCG                                  #   200            AGGT AGACGTCAGC CCATCCCCAA                                  #   240            GGCA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGC TGCGGGTGGG                                  #   320            TCCC CGTGGCTCTC GGCCTAGCTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCTC TTGGAGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAACCTTCCC GGTTGCTCTT                                  #           549    GGCC CTGCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:56:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # gm2     (C) INDIVIDUAL ISOLATE:                                              #56:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAGA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAGACTTCCG                                  #   200            AGGT AGACGTCAGC CTATCCCCAA                                  #   240            GGTA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT TGCGGGTGGG                                  #   320            TCCC CGCGGCTCTC GGCCTAACTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCTC TTGGAGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAACCTTCCT GGTTGCTCTT                                  #           549    GGCC CTGCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:57:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i21     (C) INDIVIDUAL ISOLATE:                                              #57:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGTGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAGACTTCCG                                  #   200            TGGT AGACGCCAGC CTATCCCCAA                                  #   240            GGCA GGACCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT TGCGGGTGGG                                  #   320            TCCC CGTGGCTCTC GGCCTAGCTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCTC TTGGAGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAAGAC                                  #   520            CAGG GAACCTTCCT GGTTGCTCTT                                  #           549    GGCC CTGCTCTCT                                              - (2) INFORMATION FOR SEQ ID NO:58:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # us4     (C) INDIVIDUAL ISOLATE:                                              #58:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTTAAGT TCCCGGGCGG                                  #   120            GGAG TTTACCTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT ATGGGGTGGG                                  #   320            ACCC CGTGGCTCTC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG TAATTTGGGT                                  #   400            TCAC ATGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCC TTAGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATCTGCCC GGTTGCTCCT                                  #           549    GGCT CTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:59:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # jh1     (C) INDIVIDUAL ISOLATE:                                              #59:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACCTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAACGAGGGT ATGGGGTGGG                                  #   320            ACCC CGTGGCTCTC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG TAATTTGGGT                                  #   400            TCAC ATGCGGCTTC GCCGACCTCA                                  #   440            TGTC GGCGCCCCCC TAGGGGGCGC                                  #   480            CATG GTGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATTTGCCC GGTTGCTCTT                                  #           549    GGCT CTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:60:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # nac5    (C) INDIVIDUAL ISOLATE:                                              #60:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC CCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACCTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGTCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAACGAGGGT ATGGGGTGGG                                  #   320            ACCC CGCGGCTCCC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG TAATTTGGGT                                  #   400            TCAC ATGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCC TAGGGGGCGC                                  #   480            CATG GTGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATTTGCCT GGTTGCTCTT                                  #           549    GGCT CTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:61:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # arg2    (C) INDIVIDUAL ISOLATE:                                              #61:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGTA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT ATGGGGTGGG                                  #   320            CCCC CGCGGCTCCC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG TAATTTGGGT                                  #   400            TCAC ATGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCC TAGGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATCTGCCC GGTTGCTCTT                                  #           549    GGCT TTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:62:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # sp1     (C) INDIVIDUAL ISOLATE:                                              #62:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACCTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT CTGGGGTGGG                                  #   320            ACCC CGCGGCTCTC GGCCTAGCTG                                  #   360            CGGC GTAGGTCGCG CAACTTGGGT                                  #   400            TTAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCC TTAGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATTTGCCC GGTTGCTCTT                                  #           549    GGCT TTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:63:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # gh1     (C) INDIVIDUAL ISOLATE:                                              #63:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT ATGGGGTGGG                                  #   320            ACCC CGTGGTTCTC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG CAATTTGGGT                                  #   400            TCAC GTGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCC TAGGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATCTGCCC GGTTGCTCCT                                  #           549    GGCT TTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:64:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i15     (C) INDIVIDUAL ISOLATE:                                              #64:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAACGTA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TTTACCTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACTAGG AAGACTTCCG                                  #   200            TGGA AGGCGACAAC CTATCCCCAA                                  #   240            GGCA GGGCCTGGGC TCAGCCCGGG                                  #   280            ATGG CAATGAGGGT ATGGGGTGGG                                  #   320            ACCC CGCGGCTCCC GGCCTAGTTG                                  #   360            CGGC GTAGGTCGCG TAATTTGGGT                                  #   400            TCAC ATGCGGCTTC GCCGACCTCA                                  #   440            CGTC GGCGCCCCCT TAGGGGGCGC                                  #   480            CATG GCGTCCGGGT TCTGGAGGAC                                  #   520            CAGG GAATCTACCC GGTTGCTCTT                                  #           549    GGCT TTGCTGTCC                                              - (2) INFORMATION FOR SEQ ID NO:65:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 549 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # i10     (C) INDIVIDUAL ISOLATE:                                              #65:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAAAGAA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TATACTTGCT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAAACTTCCG                                  #   200            CGGA AGGCGTCAGC CCATCCCTAA                                  #   240            GGCA AGTCCTGGGG AAGGCCAGGA                                  #   280            ATGG GAATGAGGGT CTCGGCTGGG                                  #   320            CCCC CGTGGCTCTC GCCCTTCATG                                  #   360            CGGC ATAGATCGCG CAACTTGGGT                                  #   400            TAAC GTGCGGTTTT GCCGACCTCA                                  #   440            CATC GGCGCCCCCG TTGGAGGCGT                                  #   480            CACG GAGTGAGGGT TCTGGAGGAT                                  #   520            CAGG GAATTTGCCC GGTTGCTCTT                                  #           549    AGCC CTCTTGTCT                                              - (2) INFORMATION FOR SEQ ID NO:66:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 510 nucleot - #ides                                                (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 -     (vi) ORIGINAL SOURCE:                                                    # arg6    (C) INDIVIDUAL ISOLATE:                                              #66:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #    40            AACC TCAAAGAAAA ACCAAAAGAA                                  #    80            ACAG GACGTCAAGT TCCCGGGCGG                                  #   120            GGAG TATACTTGTT GCCGCGCAGG                                  #   160            TGCG CGCGACGAGG AAAACTTCCG                                  #   200            TGGG AGGCGCCAGC CCATCCCCAA                                  #   240            GGCA AGTCCTGGGG GAAGCCAGGA                                  #   280            ATGG GAATGAGGGT CTCGGCTGGG                                  #   320            CCCC CGCGGTTCTC GCCCTTCATG                                  #   360            CGGC ATAGATCACG CAACTTGGGT                                  #   400            TAAC GTGTGGTTTT GCCGACCTCA                                  #   440            CGGT GGTGCCCCCG TTGGTGGTGT                                  #   480            CATG GGGTGAGGGT TCTGGAAGAC                                  #          510     CAGG GAATCTGCCC                                             - NFORMATION FOR SEQ ID NO:67:                                                 -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 29 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #67:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #            29    CGRC GCCCACAGG                                              - (2) INFORMATION FOR SEQ ID NO:68:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 24 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #68:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #                24CCCC CACG                                                   - (2) INFORMATION FOR SEQ ID NO:69:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 30 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #69:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #           30     CGTC AGCCTATCCC                                             - (2) INFORMATION FOR SEQ ID NO:70:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 30 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #70:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #           30     CGAC AACCTATCCC                                             - (2) INFORMATION FOR SEQ ID NO:71:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 30 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #71:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #           30     CTAA CTCGAGTATT                                             - (2) INFORMATION FOR SEQ ID NO:72:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 26 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #72:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #              26  CCAA CTCAAG                                                 - (2) INFORMATION FOR SEQ ID NO:73:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 28 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #73:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #             28   WGCY CACTGGGG                                               - (2) INFORMATION FOR SEQ ID NO:74:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 28 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #74:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #             28   GGCY CACTGGGG                                               - (2) INFORMATION FOR SEQ ID NO:75:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 20 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #75:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    # 20               CYAC                                                        - (2) INFORMATION FOR SEQ ID NO:76:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 26 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #76:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #              26  CCRC CATGGA                                                 - (2) INFORMATION FOR SEQ ID NO:77:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 22 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #77:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #                 22GYC AT                                                     - (2) INFORMATION FOR SEQ ID NO:78:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 18 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #78:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #  18              CG                                                          - (2) INFORMATION FOR SEQ ID NO:79:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 28 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #79:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #             28   TGTG AGGAACTA                                               - (2) INFORMATION FOR SEQ ID NO:80:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 18 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #80:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #  18              AA                                                          - (2) INFORMATION FOR SEQ ID NO:81:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #81:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ARRC AAGARAGCAG GGC                                         - (2) INFORMATION FOR SEQ ID NO:82:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #82:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TTGC GCACTTGGTR GGC                                         - (2) INFORMATION FOR SEQ ID NO:83:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #83:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CAAT CATTGGTGAC RTG                                         - (2) INFORMATION FOR SEQ ID NO:84:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #84:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TCRK BCGYCTCGTA CAC                                         - (2) INFORMATION FOR SEQ ID NO:85:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #85:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CAAG GGACRCACCC CGG                                         - (2) INFORMATION FOR SEQ ID NO:86:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #86:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACCC AACACCTCGA GRC                                         - (2) INFORMATION FOR SEQ ID NO:87:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #87:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCRT CCCTGGTGGC YAC                                         - (2) INFORMATION FOR SEQ ID NO:88:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #88:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ATGT GACGTCGAAG CTG                                         - (2) INFORMATION FOR SEQ ID NO:89:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #89:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GARC AGAGRGTGGC GCY                                         - (2) INFORMATION FOR SEQ ID NO:90:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #90:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACAG ACCCGCAYAR GTC                                         - (2) INFORMATION FOR SEQ ID NO:91:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #91:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGAG AGAAGGTGAA CAG                                         - (2) INFORMATION FOR SEQ ID NO:92:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #92:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CAAT TGCARYCTTG CGT                                         - (2) INFORMATION FOR SEQ ID NO:93:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #93:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CGGT GACCCGTTAY ATG                                         - (2) INFORMATION FOR SEQ ID NO:94:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #94:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGGG ACCARTTCAT CAT                                         - (2) INFORMATION FOR SEQ ID NO:95:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #95:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CGGA GYASCTGAGC YAY                                         - (2) INFORMATION FOR SEQ ID NO:96:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #96:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCAG CGATCATRTC CAW                                         - (2) INFORMATION FOR SEQ ID NO:97:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #97:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TACG CTATGCCCGC YAG                                         - (2) INFORMATION FOR SEQ ID NO:98:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #98:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ARGA CCTTCGCCCA GTT                                         - (2) INFORMATION FOR SEQ ID NO:99:                                            -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #99:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GCGT CRACGCCGGC RAA                                         - (2) INFORMATION FOR SEQ ID NO:100:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #100: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ARRC ARGASAGCAR AGC                                         - (2) INFORMATION FOR SEQ ID NO:101:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #101: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TTGC GCACTTCRTA AGC                                         - (2) INFORMATION FOR SEQ ID NO:102:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #102: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CART CGTTYGTGAC ATG                                         - (2) INFORMATION FOR SEQ ID NO:103:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #103: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TCCG YYGCCTCATA CAC                                         - (2) INFORMATION FOR SEQ ID NO:104:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #104: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CARG GCACGCACCC RGG                                         - (2) INFORMATION FOR SEQ ID NO:105:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #105: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACCC AGCARCGGGA GSW                                         - (2) INFORMATION FOR SEQ ID NO:106:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #106: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       KHRT TCCTGGCCGC VAR                                         - (2) INFORMATION FOR SEQ ID NO:107:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #107: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACRT GRCGTCGTAW TGT                                         - (2) INFORMATION FOR SEQ ID NO:108:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #108: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GAMS AGARRGYAGC CGY                                         - (2) INFORMATION FOR SEQ ID NO:109:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #109: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACAG ATCCGCARAG RTC                                         - (2) INFORMATION FOR SEQ ID NO:110:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #110: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GSBG AGAAGGTGAA YAG                                         - (2) INFORMATION FOR SEQ ID NO:111:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #111: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CART TGCAKTCCTG YAC                                         - (2) INFORMATION FOR SEQ ID NO:112:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #112: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CGRT GGCCTGAYAC CTG                                         - (2) INFORMATION FOR SEQ ID NO:113:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #113: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGYG ACCAGTTCAT CAT                                         - (2) INFORMATION FOR SEQ ID NO:114:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #114: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CGGA GTAACTGCGA YAC                                         - (2) INFORMATION FOR SEQ ID NO:115:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #115: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCCG CCACCATRTC CAT                                         - (2) INFORMATION FOR SEQ ID NO:116:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #116: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TAGG CAAGGCCCGC YAG                                         - (2) INFORMATION FOR SEQ ID NO:117:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #117: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       AADA CCTTAGCCCA GTT                                         - (2) INFORMATION FOR SEQ ID NO:118:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #118: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCGT CAACGCCGGC AAA                                         - (2) INFORMATION FOR SEQ ID NO:119:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #119: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ATTC ATGGTGGAGT GTC                                         - (2) INFORMATION FOR SEQ ID NO:120:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #120: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TGCG TGAAGACAGT AGT                                         - (2) INFORMATION FOR SEQ ID NO:121:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #121: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       RCAC TCATACTAAC GCC                                         - (2) INFORMATION FOR SEQ ID NO:122:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #122: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TCTY CCGGGAGGGG GGG                                         - (2) INFORMATION FOR SEQ ID NO:123:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #123: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGTG TACTCACCGG TTC                                         - (2) INFORMATION FOR SEQ ID NO:124:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #124: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TCCA AGAAAGGACC CGG                                         - (2) INFORMATION FOR SEQ ID NO:125:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #125: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CACG CCCAARTCTC CAG                                         - (2) INFORMATION FOR SEQ ID NO:126:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #126: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCCA ACACTACTCG GCT                                         - (2) INFORMATION FOR SEQ ID NO:127:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #127: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCCT ATCAGGCAGT ACC                                         - (2) INFORMATION FOR SEQ ID NO:128:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #128: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGTC TACGAGACCT CCC                                         - (2) INFORMATION FOR SEQ ID NO:129:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #129: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       YTTT GRGGTTTRGG AWT                                         - (2) INFORMATION FOR SEQ ID NO:130:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #130: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TGTG GGCGRCGGTT GGT                                         - (2) INFORMATION FOR SEQ ID NO:131:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #131: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       ACGA TCTGRCCRCC RCC                                         - (2) INFORMATION FOR SEQ ID NO:132:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #132: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGGC CCCTGCGCGG CAA                                         - (2) INFORMATION FOR SEQ ID NO:133:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #133: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GAAG TCTTYCTRGT CGC                                         - (2) INFORMATION FOR SEQ ID NO:134:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #134: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGCT GACGTCWACC TCG                                         - (2) INFORMATION FOR SEQ ID NO:135:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #135: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GGTT GTCGCCWTCC ACG                                         - (2) INFORMATION FOR SEQ ID NO:136:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #136: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       RYCC TRCCCTCGGR YYG                                         - (2) INFORMATION FOR SEQ ID NO:137:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #137: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TAGA GGGGCCADGG RTA                                         - (2) INFORMATION FOR SEQ ID NO:138:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #138: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       AGCC ATCCYGCCCA CCC                                         - (2) INFORMATION FOR SEQ ID NO:139:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #139: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCCC AYCTAGGCCG RGA                                         - (2) INFORMATION FOR SEQ ID NO:140:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #140: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       AART TRCGCGACCT RCG                                         - (2) INFORMATION FOR SEQ ID NO:141:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #141: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       AAGC CGCAYGTRAG GGT                                         - (2) INFORMATION FOR SEQ ID NO:142:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #142: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       CCGA CGAGCGGWAT RTA                                         - (2) INFORMATION FOR SEQ ID NO:143:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #143: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       GCCA RGGCCCTGGC AGC                                         - (2) INFORMATION FOR SEQ ID NO:144:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #144: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       TTCA CGCCGTCYTC CAG                                         - (2) INFORMATION FOR SEQ ID NO:145:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 33 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #145: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    #         33       AAAG AGCAACCRGG MAR                                         - (2) INFORMATION FOR SEQ ID NO:146:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 20 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #146: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    # 20               TCTT                                                        - (2) INFORMATION FOR SEQ ID NO:147:                                           -      (i) SEQUENCE CHARACTERISTICS:                                                     (A) LENGTH: 20 nucleoti - #des                                                 (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS:  sing - #le                                                  (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE:  DNA                                                 #147: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    # 20               GGTG                                                        __________________________________________________________________________ 

We claim:
 1. A method of forming a hybridization product with a hepatitis C virus nucleic acid comprising the steps of:(a) placing an isolated or synthesized nucleic acid into conditions in which hybridizations conditions can be imposed, said isolated or synthesized nucleic acid comprising a nucleotide sequence of eight or more contiguous nucleotides fully homologous to or fully complementary to a sequence within a non-HCV-1, hepatitis C viral genome, wherein said sequence is selected from the group consisting of SEQ ID NOs 7-12, 14-17, 19-22, 25-32, 35-36, 38-51, and 53-66, and wherein said isolated or synthesized nucleic acid is capable of forming a hybridization product with said hepatitis C virus nucleic acid, if present under hybridization conditions; and (b) imposing hybridization conditions to form a hybridization product in the presence of hepatitis C virus nucleic acid.
 2. The method of claim 1, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within the NS5 region of the hepatitis C virus genome.
 3. The method of claim 2, wherein said nucleotide sequence is selected from the group consisting of SEQ ID NOs 7-12, 14-17, and 19-22.
 4. The method of claim 1, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within the envelope 1 region of the HCV genome.
 5. The method of claim 1, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within the 5'UT region of the HCV genome.
 6. The method of claim 5, wherein said nucleotide sequence is selected from the group consisting of SEQ ID NOs 35, 36, and 38-51.
 7. The method of claim 1, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within the core region of the HCV genome.
 8. The method of claim 7, wherein said nucleotide sequence is selected from the group consisting of SEQ ID NOs 53-66.
 9. The method according to any of claims 1-7 or 8, wherein said nucleotide sequence is fifteen nucleotides or more.
 10. The method according to any of claims 1-7 or 8, wherein said nucleotide sequence is twenty-four nucleotides or more.
 11. The method of claim 1, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of one or more genotypes of a hepatitis C virus.
 12. The method of claim 11, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a first genotype of HCV, which first genotype is defined by;SEQ ID NO: 25 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 35, 36, 38 in the 5'UT region of the HCV genome; or any one of SEQ ID NOs 53-57 in the core region of the HCV genome.
 13. The method of claim 11, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a third genotype of HCV, which third genotype is defined by;any one of SEQ ID NOs 14-17 in the NS5 region of the HCV genome; or SEQ ID NO 32 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 46-47 in the 5'UT region of the HCV genome; or any one of SEQ ID NOs 65-66 in the core region of the HCV genome.
 14. The method of claim 11, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a fourth genotype of HCV, which fourth genotype is defined by;any one of SEQ ID NOs 20-22 in the NS5 region of the HCV genome; or any one of SEQ ID NOs 29-31 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 48-49 in the 5'UT region of the HCV genome.
 15. The method of claim 11, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a fifth genotype of HCV, which fifth genotype is defined by;SEQ ID NO: 19 in the NS5 region of the HCV genome; or any one of SEQ ID NOs 50-51 in the 5'UT region of the HCV genome.
 16. The method of claim 1, wherein said hybridization product is capable of priming a reaction for the synthesis of nucleic acid.
 17. The method of claim 1, wherein said non-naturally occurring nucleic acid has label means for detecting a hybridization product.
 18. The method of claim 1, wherein said non-naturally occurring nucleic acid has support means for separating a hybridization product from solution.
 19. The method of claim 2, wherein said non-naturally occurring nucleic acid prevents the transcription or translation of viral nucleic acid.
 20. A method of detecting one or more genotypes of hepatitis C virus comprising the steps of:(a) placing an isolated or synthesized nucleic acid into conditions in which hybridizations conditions can be imposed, said isolated or synthesized nucleic acid comprising a nucleotide sequence of eight or more contiguous nucleotides fully homologous to or fully complementary to a sequence within a non-HCV-1, hepatitis C viral genome, wherein said sequence is selected from the group consisting of SEQ ID NOs 7-12, 14-17, 19-22, 25-32, 35-36, 38-51, and 53-145, and wherein said isolated or synthesized nucleic acid is capable of forming a hybridization product with said hepatitis C virus nucleic acid, if present, under hybridization conditions; (b) imposing hybridization conditions to form a hybridization product in the presence of hepatitis C virus nucleic acid; and (c) monitoring the isolated or synthesized nucleic acid for the formation of a hybridization product, which hybridization product is indicative of the presence of the genotype of hepatitis C virus.
 21. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a first genotype of HCV, which first genotype is defined by;SEQ ID NO: 25 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 35-36, 38 in the 5'UT region of the HCV genome; or any one of SEQ ID NOs 53-57 in the core region of the HCV genome.
 22. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a third genotype of HCV, which third genotype is defined by;any one of SEQ ID NOs 14-17 in the NS5 region of the HCV genome; or SEQ ID NO 32 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 46-47 in the 5'UT region of the HCV genome; or any one of SEQ ID NOs 65-66 in the core region of the HCV genome.
 23. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a fourth genotype of HCV, which fourth genotype is defined by;any one of SEQ ID NOs 20-22 in the NS5 region of the HCV genome; or any one of SEQ ID NOs 29-31 in the envelope 1 region of the HCV genome; or any one of SEQ ID NOs 48-49 in the 5'UT region of the HCV genome.
 24. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence of a fifth genotype of HCV, which fifth genotype is defined by;SEQ ID NO: 19 in the NS5 region of the HCV genome; or any one of SEQ ID NOs 50-51 in the 5'UT region of the HCV genome.
 25. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within SEQ ID NOs 67-145.
 26. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within SEQ ID NOs 69, 71, 73 and 81-20 and detects the presence of Group I genotypes.
 27. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within SEQ ID NO 77 and detects the presence of Group III genotypes.
 28. The method of claim 20, wherein said nucleotide sequence is fully homologous to or fully complementary to a sequence within SEQ ID NO 79 and detects the presence of Group IV genotypes.
 29. A method of detecting Group II genotype isolates of hepatitis C virus comprising the steps of:(a) placing an isolated or synthesized nucleic acid into conditions in which hybridizations conditions can be imposed, said isolated or synthesized nucleic acid comprising a nucleotide sequence of eight or more contiguous nucleotides fully homologous to or fully complementary to a sequence within a non-HCV-1, hepatitis C viral genome, wherein said sequence is selected from the group consisting of SEQ ID NOs 70, 72, 74, and 100-118, and where said isolated or synthesized nucleic acid is capable of forming a hybridization product with said hepatitis C virus nucleic acid, if present, under hybridization conditions; (b) imposing hybridization conditions to form a hybridization product in the presence of hepatitis C virus nucleic acid; and (c) monitoring the isolated or synthesized nucleic acid for the formation of a hybridization product, which hybridization product is indicative of the presence of the Group II genotype of hepatitis C virus.
 30. A method of detecting one or more genotypes of hepatitis C virus comprising the steps of:(a) placing an isolated or synthesized nucleic acid comprising a nucleotide sequence of eight or more contiguous nucleotides fully homologous to or fully complementary to a sequence within a non-HCV-1, hepatitis C viral genome, wherein said sequence is selected from the group consisting of SEQ ID NOs 2-6, 8-22, 24-32, 35-36, 38-51, and 53-66, into conditions in which hybridization conditions can be imposed, wherein said nucleotide sequence is capable of forming a hybridization product with said hepatitis C virus nucleic acid, if present, under hybridization conditions, wherein a first genotype is defined by SEQ ID NOs 25, 35-36, and 53-57; a second genotype is defined by SEQ ID NOs 8-12, 26-28, 39-45, and 58-64; a third genotype is defined by SEQ ID NOs 14-17, 32, 46-47, and 65-66; a fourth genotype is defined by SEQ ID NOs 20-22, 29-31, and 48-49; and a fifth genotype is defined by SEQ ID NO: 19; (b) imposing hybridization conditions to form a hybridization product in the presence of hepatitis C virus nucleic acid; and (c) monitoring the isolated or synthesized nucleic acid for the formation of a hybridization product, which hybridization product is indicative of the presence of the genotype of hepatitis C virus. 